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Abstract 

We present a formula that determines the optimal number of qubits per 
message that allows asymptotically faithful compression of the quantum in- 
formation carried by an ensemble of mixed states. The set of mixed states 
determines a decomposition of the Hilbert space into the redundant part and 
the irreducible part. After removing the redundancy, the optimal compres- 
sion rate is shown to be given by the von Neumann entropy of the reduced 
ensemble. 
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Consider a source that generates a message i with probabihty pi. Sequences of the 
messages independently drawn from this source can be compressed into sequences of bits 
and decompressed back to the original sequences of messages. The necessary and sufficient 
number of bits per message allowing asymptotically faithful compression and decompression 
is given by the Shannon entropy S = —J2iPi log2Pi- This result, called the noiseless coding 
theorem is one of the core results of the classical information theory. The quantum 
analogue of this theorem, which will naturally form a basis of quantum information theory, 
was first considered by Schumacher . In this quantum data compression, the source emits a 
system in a quantum state pi with probability Pi, and sequences of the systems emitted from 
this source are assumed to be compressed into qubits. It was shown [0-^ that when all pi 
are pure, the least number of qubits allowing asymptotically faithful recovery of the original 
states is given by the von Neumann entropy S{p) = — Trploggp of the density operator of 
the ensemble p = J^iPiPi- When {pi} includes mixed states, the problem is still open. Since 
compression schemes applicable to the pure-state signals can also be successfully used for 
the mixed-state cases the optimal compression rate Jp is bounded from above by the von 
Neumann entropy, namely, Jp < S{p). It has also been proved that the Levitin-Holevo 
function 0, /lh = S{p) — J2iPiS{pi), is a lower bound for /p, namely, /lh < -^p- 

The aim of this Letter is to identify the optimal compression rate for the mixed-state 
ensemble S = {pi, pi}. We first introduce a function Iji{S) that is given as the von Neumann 
entropy of a reduced ensemble £^ = {pi,ai}. The ensemble S^i is derived from S by stripping 
off the redundant parts. Then we prove that Ib.{S) is equal to the optimal compression rate 

The problem considered here is formulated as follows. Suppose that the source produces 
the ensemble S = {pi,Pi}, namely, it emits a system in a quantum state pi with probability 
Pi. Using this source N times, we obtain a state p^ = p^^ (g) ■ ■ ■ (g) p^^ acting on a Hilbert 
space = Hi ^ ■ ■ ■ ® Hn with probabihty p^ = Pi^ . . -Pij^, where A represents a set of 
indexes {ii, . . . ,iN}- We assume that the dimension d of each space 7i„ is finite. Now 
is given to Alice, who compresses the signal p^ into px acting on a Hilbert space He with a 
dimension usually smaller than Nd. This process is generally written by a quantum operation 
(linear completely positive trace-preserving map) Px ^ Px = ^a{Px)- The operation Aa is 
independent of A since only the systems H^ are given to Alice and no additional information 
on A is available. The coded signal px is passed on to Bob through a noiseless channel, and 
he decompresses the signal by a quantum operation px p'x = Ab(pa), where p'^^ acts on 
H^ . To measure the quality of the whole process p^ — > p'^^, we use the fidelity F |g] given 

by F{p,a) = [Try^pVVp^]^. The quality of a compression scheme specified by (Aa,Ab) 
for the ensemble S is given by the average fidelity 

f^j:p>^f{pIp',). (1) 

A 

Now, for a fixed source S, consider a sequence of compression schemes (Aa^-*, Aj^-*) with 
increasing A^. When limTv^oo-^ = 1, the sequence gives asymptotically faithful compres- 
sion of £. Such sequences are called protocols 0. For a given protocol P, the quantity 
R{P) characterizing the asymptotic degree of compression is defined through the size of He 
measured in the number of qubits, namely, 

i?(P) =lim(log2dim7^c)/A^. (2) 
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Then, the optimal compression rate Ip{S) for the ensemble S is formally defined as 



/p(^)^infi?(P). (3) 

This means that for arbitrary small 6 > 0, asymptotically faithful compression is possible if 
Ip + S qubits per message is given, and it is impossible if Ip — 6 qubits per message is given. 

A useful tool used for stripping off the redundant part in S and deriving the formula for 
Ip{S) below is the theory ^ that characterizes the quantum operations which preserves a 
set of states {pi} (maps pi to pi) acting on a Hilbert space H. To state the results of this 
theory, it is convenient to express quantum operations in unitary representation, namely, 
by unitary operations U acting on the combined space H ® He, where He represents an 
auxiliary system initially prepared in a standard pure state Se- Then, it was shown 
that, given {pi}, we can find a decomposition of Ha defined as the support of J2iPi ('^a is 
generally a subspace of H) written as 

^A=e^?®^g, (4) 

I 

in such a way that any U preserving {pi} is expressed in the following form 

U{1a ® Se) = 1? ® uiU4^ ® Se), (5) 
I 

where ?7ke ^^^^ unitary operators acting on the combined space H^^ ® He- Under this 
decomposition, pi is written as 

p.=e'?^^''Vf^®pi{\ (6) 

where pj'''' and p^ are normalized density operators acting on Hf^ and H^ , respectively, 
and g*^*''-* is the probability for the state to be in the subspace H^P ®H^ . is independent 
of i, and {pf'^\pf'^\ ■ ■ ■} cannot be expressed in a simultaneously block-diagonalized form. 
An explicit procedure to obtain this particular decomposition is also given in 

The form of Eq. implies that the spaces H^^ are redundant in the ensemble S = 
{pi,pi}. Consider the states ai = 0/ 5'*-*''Vj which the redundancy has been removed, 
and let = {pi, a^} be the corresponding ensemble. The von Neumann entropy of can 
be regarded as a function of the ensemble £, denoted as Ir{S), since the decomposition 
is determined by the set {pi}- What we prove below is that the optimal compression rate 
Ip{S) is given by the function Ik{S). 

We begin the proof by noting that the two ensembles S and are completely inter- 
changeable, namely, there exist quantum operations A^-p and Ap„ that satisfy Aa.p(p^) = <7x 
and Apo-(crj^) = Px- If a compression scheme (Aa, Ab) for p^ is given, we can compose a com- 
pression scheme (AaApo-, Ao-pAe) for ■ Since the fidelity does not decrease under a quan- 
tum operation ]10|,|n[, we have the inequality F{p^, AbAa{px)) < F^a^, A^pABAAAp<^((jj^)). 
Hence the composed scheme always has a better or equal average fidelity. This implies that 
if a protocol for S with an asymptotic degree of compression R is given, we can compose a 
protocol for S^i with the same degree R P,p!0|. Consequently, we have Ip{S) > /p(£^R). Since 



a similar argument can be made with p and a interchanged, we obtain the equality 
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US) = I^iSn). 



(7) 



Now it is suffice to consider the cases where {pi} have no redundancy, namely, cTj = pi 
and £^ = S, and we will prove the relation Ip{S) = S{p) in these cases. Since we already 
have the inequality Ip{S) < S{p), what we need is the opposite inequality, Ip{S) > S{p). 
We will give a sketch of the proof ffist. 

In a compression-decompression scheme (Aa,Ab), the state eventually evolves as p^ — 
p')^ = A(p^), where A = AbAa- In this process, the marginal state in the ffist system (TCi) 
evolves from p^^ to Tr2...Ar(p^). This evolution can be regarded as a result of a quantum 
operation Ai, defined as 

M{pi) = J2Pi2 ■ • •PijvTr2...AfA(pi (g)pi2 (g) • • ■ Opijv) 

= Tr2...7vA(pi®p®---®p). (8) 

Note that Ai is determined by A and the total density operators (p) of the initial state 
ensembles of the other N — 1 systems. In a protocol, a scheme (Aa, Ab) for large is nearly 
perfect. For this scheme, Ai will almost preserve the states {pi}. The decomposition 
for {pi} satisfying pi = can be simplified as Tix = since 1-6^ is a one-dimensional 

space. Correspondingly, the requirement for preserving {pj} can be written as 

f/(lA®SE)=01?®t/g^SE, (9) 
/ 

where u'^ are unitary operators acting on TYe- The operation Ai, which nearly preserves 
{pi}, should thus be approximately written in the form (j^). Next, take a diagonalization 
of the total density operator, p = X^zsPz.sl^, 'S)(/, s|, in such a way that for a fixed /, the 
set {|^s)} forms a basis of TH-f' . Let us consider an ensemble 8±_ = {p/,s,Pz,s = \l,s){l,s\} 
composed of orthogonal pure states. If we replace the source from £ to £± in the scheme 
(Aa,Ab), the operation Ai does not change because the total density operator is identical 
for the two ensembles. Then, the error in the transmission of \l,s) will be small since the 
operation of the form (P) preserves {|/, s)}. This means that by a projection measurement in 
the basis {|/, s)}, classical information close to NS{p) bits can be sent through the channel 
He- This implies log2 dimHc ^ NS{p). Combined with the definitions (||) and (|^), we have 
^ S{p). 

The strict proof is given by clarifying the meaning of 'nearly' in the above sketch, by 
introducing several measures (/ and g below) characterizing the nearness. In unitary rep- 
resentation, any quantum operation for the system Hi can be represented by a unitary 
operator U in d + d"^ = n dimension [O, acting on the combined space of Tii and an auxil- 



iary system Ti-E with dimension d"^. Let us introduce two nonnegative continuous functions 
/, g : U{n) R that measure how U eU (n) is close to the form (y). The ffist one is defined 
as f{U) = 1 - E^P^Fip^,Mp^)), whcrc Auipi) = TvEpip, ® Se)^/^]. Since /(f/) = iff 
Au{Pi) = Pi for all i, /~^(0) is equal to the set of U that can be expressed in the form (|). The 
other measure is related to the average error probability of the transmission of S±, defined 
as Pe = 1 — Z]z,sP/,sTr(pi^sA[/(p/ ,,))• For later convenience, we use the function g{U) defined 
through pe, namely, g{U) = H{pc) +Pc^og2{d—l) with if (p) = —p\og2p — (l—p) log2(l— p). 
Since the form (P) preserves {p/,s}, g{U) is zero for any U G /^^(O). An important relation 
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between the two measures is that if g is away from zero, / must also be away from zero. This 
is proved as follows. Let us define the set Xs = {U\g{U) > 6} for arbitrary 6 > 0. Since g is 
continuous, Xs is a closed subset of f/(n). Since U{n) is compact and / is continuous, the 
image f{Xs) is closed in R. Xs fl /^^(O) = implies that ^ f{Xs). Therefore, f{Xs) has 
its minimum ri{6) > 0. This result will be used to derive the inequality (0) below. Note 
that the functional dependence of 1] on 6 is determined by S, and is independent of A^. 

Next, we consider the transmission of classical variable {(/, s)} through the source S± and 
the scheme (Aa, Ab). Let Xk{k = 1, . . . , N) be independent random (vector) variables with 
Pr{Xfc = (/, s)} = pi^s, and X = {Xi, . . . ,Xn}. Suppose that the value of X^ is encoded 
to the state \l,s) in the system Hk, the compression- decompression scheme is applied to 
combined system , and finally the state in each Hk is measured by the projection to the 
basis \l,s), producing a result 1^. The transmitted data is represented by Y = {Yi, . . . ,Yn}. 
The quantum operation A^ on each system Tik can be written in a similar form as (^. 
Let us take a unitary representation Uk G U{n) for A^. A lower bound for the mutual 
information I{X; Y) = H{X) —H{X\Y) in this example is obtained as follows. Since X^, are 
independent, we have H{X) = J2k H{Xk) = NS{p). From the general properties of entropy, 
we obtain the following inequalities ||13|: g{Uk) > H{Xk\Yk) (Fano's inequality), H{Xk\Yk) > 
if(Xfc|y) (conditioning reduces entropy), a.iadJ2kH{Xk\Y) > H{X\Y) (independence bound 
on entropy). Combining these, we have /(X; Y) > NS{p) — J2k9{Uk)- On the other hand, 
/(X; Y) cannot exceed the capacity of the channel He, namely, logj dimTic > H-^'^ We 
thus arrive at the relation 

T.9{Uk)/N > S{p) - (log2dim7^c)/A^. (10) 

k 

Now let us suppose that the number of available qubits per message is smaller than 
S{p), namely, (log2 dimHc)/A^ = S{p) — 6 with 6 > 0. Since the numbering of the systems 
Tik is arbitrary, we can generally assume that g{Ui) is not smaller than any other g{Uk)- 
Then, from the relation (p!OD we have g{Ui) > S, or equivalently, Ui G Xs- As shown above, 
this implies f{Ui) = 1 — J^iPi^iPi, -^i{Pi)) > ''7(<^) > 0. From the properties of the fidelity 
function F, we obtain 

A 

< EP^^(Pn,Tr2...7vA(p., ®p.2 ® ■■■®P^iv)) 

A 

<J2p,F{p,,A,{pi))<l-7]{6) (11) 

i 

since the fidelity does not decrease under partial trace (the first inequality) and F{a,p) is 
convex as a function of p (the second). The average fidelity of the compression-decompression 
scheme never exceeds 1 — ?7(5) < 1 for any N, where r]{S) is independent of N. This means 
that no protocols exist that satisfy R{P) = S{p) — 6. Hence Ip{S) > S{p). Combined with 
the opposite inequality Ip{S) < S{p), we obtain Ip{S) = S{p) for the ensemble £ satisfying 
S = £ji. Together with Eq. (|^), we obtain the formula for general S, 

Ip{S)=In{S), (12) 
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which is the main result of this Letter. For convenience, we repeat the definition of the 
function Ir{S): From S = {pi,Pi}, determine £r = {pi,<Ji} through the decomposition 
Then, Jr = S{a) with a = J2iPi(^i- 

The protocols we considered above is asymptotically reversible, namely. Bob is required 
to asymptotically reproduce everything that was given to Alice. Bob can thus compress the 
reproduced signals again with the same degree of compression. This class of protocols is 
called blind protocols, and the obtained bound Jp is called passive information PJTI|. In 
another scenario, not only the system but also the identity of the state p^, namely, 
the index A is disclosed to Alice. Bob still has to decompress the signal without additional 
knowledge of A. This class of protocols is called visible protocols, and the corresponding 
optimal compression rate is called effective information Ics ||6|,p!0[|. This scheme is irreversible 
and cannot be repeated, but the compression rate les may be better than /p. The difference 
/d = /p — /gfj is called information defect. For an ensemble of pure states, it was shown 
that the information defect is zero ||^. While the identity of les is still an open question, 
the derived form of Jp assures the presence of nonzero information defect for an ensemble 
of mixed states, which can be shown as follows. In the second scenario, Alice can compress 
the classical value A into the length of Shannon entropy, and send it directly to Bob. This 
indicates I^s < —J2iPi^og2Pi. For example, if Pi = P2 = 1/2, < 1- On the other hand, 
by allowing the dimension d large, we can find examples of pi and p2 with arbitrarily large 
Jp, according to the result (|12]). 

Finally, we would like to raise several problems which is worthy of future investiga- 
tion. What we have proved in this Letter corresponds to the so-called the weak converse 
of Shannon's noiseless coding theorem, namely, if Jp — 5 qubits are available per system, 
the fidelity cannot reach unity in iV ^ cxd. For classical or pure-state ensembles, the strong 
converse holds, namely, the fidelity goes to zero when — > 00. Whether this statement 
holds for mixed-state cases or not is an important open question. In the proof of the main 
result, we utilized the observation that any protocols for an mixed-state ensemble S with no 
redundancy {S = Sb) can be used to transmit the 'purified' ensemble S± with errors asymp- 
totically negligible per message. The requirement [F —>■ 1) for the compression protocols for 
S± is more stringent, namely, the total errors for the whole A^ messages must be negligible. 
Whether the protocols for S always works as compression protocols for S± or not is another 
open question. 

In summary, we derived the formula for the optimal compression rate (passive informa- 
tion) for a general mixed-state ensemble {pi,Pi}. This will give a measure of how much 
information is stored in the ensemble of quantum states in terms of qubits. We have also 
shown the presence of nonzero information defect, namely, there are cases where knowing 
the identity of states gives more efficient compression. 
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